Search CORE

55 research outputs found

L’apport des corpus échantillonnés aux descriptions grammaticales. L’exemple des formes contre et entre

Author: Bilger Mireille
Cappeau Paul
Publication venue: University of Bern
Publication date: 01/01/2016
Field of study

The following article sets out to show how different kinds of sampled corpus data can be used to provide a critical view of certain descriptions given in reference works (grammars and dictionaries) which, for French, tend to concentrate on literary usage. We show how quantitative and qualitative methods can be proposed to nuance the descriptions, using the example of the forms contre and entre in four different corpora: private speech, public speech, literature, press

Directory of Open Access Journals

BOP Serials

Que peuvent nous apprendre en syntaxe des corpus oraux « anciens » ?

Author: Cappeau Paul
Publication venue: 'OpenEdition'
Publication date: 14/02/2011
Field of study

Comment peut-on s’appuyer sur des corpus oraux anciens pour détecter certaines variations en train d’advenir dans la morphosyntaxe du français parlé ? Le présent article revient sur la notion d’ancien appliqué à l’oral, et présente quelques possibles changements dans les domaines du lexique et de la morphosyntaxe. La description s’appuie sur des enregistrements de la Phonothèque (1958-1979) qui sont comparés avec des corpus oraux plus récents.How is it possible to lean on old oral corpora to identify current changes in the morphosyntax of spoken French? What does the term “old” mean when applied to oral corpora? What kind of work is it possible to do out of them? This article examines some potential lexical and morphosyntactic changes which appeared recently. Two kinds of corpora are used: the Phonothèque corpora (registered between 1958 and 1970) and some contemporary spoken corpora

OpenEdition

Ce que l’oral nous a appris sur la syntaxe du français

Author: Cappeau Paul
Publication venue: 'OpenEdition'
Publication date: 27/03/2019
Field of study

Le thème de ce numéro offre un bon prétexte pour dresser, sur quelques points, un bilan de l’intérêt de l’oral pour une meilleure connaissance de la syntaxe du français. Travailler à partir de données orales impose un certain nombre de contraintes, qu’il est peut-être utile de rappeler (en partie du moins). Il est, par exemple, indispensable de disposer de corpus fiables : ce qui induit que leur constitution a été préparée (il ne suffit pas de collecter des données au hasard pour constituer u..

OpenEdition

Comment les données de corpus pourraient renouveler les manuels de grammaire ?

Author: Bilger Mireille
Cappeau Paul
Publication venue: 'OpenEdition'
Publication date: 19/11/2015
Field of study

L’exploitation de grands corpus a permis de donner un éclairage neuf sur de nombreux faits de langue et d’en renouveler à la fois l’analyse et la présentation. Ce constat est indéniable notamment pour l’anglais, mais en ce qui concerne la langue française ces types de travaux sont encore quasi inexistants ou bien trop parcellaires. L’objet de cet article est de montrer comment les données de corpus conduisent à adopter un regard critique sur certaines descriptions proposées dans les grammaires ou les manuels et à indiquer quels aménagements pourraient être apportés.The use of large corpora has brought a fresh understanding of many aspects of language and has brought renewed methods of analysis and presentation. Although this is particularly true of English, for French this type of approach is typically lacking or absent even. The aim of this article is to show how corpora can help us to develop a critical account of certain descriptions given in grammar books or manuals and can help suggest changes to these

OpenEdition

Partition et topicalisation : il y en a « stabilisateur » de sujets et de topiques indéfinis

Author: Cappeau Paul
Deulofeu José
Publication venue: Cahiers de praxématique
Publication date: 27/07/2009
Field of study

L’article traite des contraintes portant sur les pronoms indéfinis en position sujet. L’étude est menée à partir de corpus appartenant à divers registres. On contraste les emplois de sujets purs avec ceux où l’indéfini est présenté par les structures il y en a… qui, il y en a … ils. On propose d’analyser ces structures comme des stabilisateurs de la relation sujet. Leur effet est de libérer les indéfinis des contraintes microsyntaxiques qui grouvernent leur apparition en position sujet, notamment les traits sémantiques du verbe. Les deux constructions stabilisatrices ont cependant chacune leurs conditions propres d’emploi et ne constituent pas des variantes.This paper deals with the types of constraints bearing on indefinite pronouns in subject position in French. This study is based on the two distinct structures in which such pronouns appear is analysed: subject pronouns on their own versus presentative structures such as: il + V vs il y en a + indefinite pronoun + qui which appear as an alternative in subject or topic position specially in spoken spontaneous language. The factors governing the distribution of the indefinite in these different structures are listed. They include: syntactic structure (micro or macrosyntactic), type of indefinite pronoun, semantic features of the verbal predicate, discourse factors, register. It is shown that the presentative verbs function as “stabilizers” freeing the use of indefinite pronouns from many microsyntactic constraints

OpenEdition

Identifier et caractériser un genre: l'exemple des interviews politiques

Author: Blasco-Dulbecco Mylène
Cappeau Paul
Publication venue: 'CAIRN'
Publication date: 01/01/2012
Field of study

International audienceHow to identify and define a genre : the example of political interviews This paper proposes a reflexion on the notion of genre in the context of linguistics based on a specific corpus and on the new tools this methodological approach offers. The corpus under examination is made of the oral language developed by politicians through interviews. The aim of this study is to show how to analyze such a corpus and how to answer the methodological questions raised by the seeking of the relevant morpho-syntaxical criteria which would then allow us to define a genre. From a comparative study between the initial corpus and others (oral and written), we propose to determine the linguistic factual elements directly related to some parts of these verbal productionsCet article propose une réflexion sur la notion de genre dans le cadre de la linguistique sur corpus et sur la méthodologie que ces nouveaux outils permettent d'envisager. Notre réflexion prend appui sur la langue orale des hommes politiques, à travers des interviews. Notre objectif est de montrer comment peut être abordé un tel corpus et quelles questions de méthode soulève la recherche de critères morphosyntaxiques pertinents pour cerner un genre. A partir d'une étude comparative entre le corpus initial et d'autres corpus (de langue orale et écrite), nous proposons de mieux cerner les faits de langue qui relèvent de certaines des composantes de ces productions

HAL Clermont Université

Réflexions sur les exploitations différenciées de la grammaire

Author: Benzitoun Christophe
Cappeau Paul
Corminboeuf Gilles
Publication venue: 'OpenEdition'
Publication date: 17/07/2018
Field of study

Notre réflexion porte sur deux aspects en particulier : (i) les questions d’ordre méthodologique qui ont trait à la constitution des données (sélection des sous-corpus, taille de ceux-ci, hiérarchie entre eux, etc.), et (ii) l’intégration dans les ouvrages de référence des rendements multiples de la grammaire.En nous fondant sur des corpus diversifiés, nous présentons les résultats de deux analyses linguistiques, l’une sur l’unité lexicale justement et l’autre sur la construction syntaxique du type il y en a (beaucoup) qui dansent. Ces deux objets d’étude illustrent pour l’un des disparités importantes selon les corpus et pour l’autre une différence notable entre oral et écrit. En parallèle, nous comparons nos analyses sur corpus avec les traitements proposés dans des grammaires et des dictionnaires.Notre recherche souligne qu’une description linguistique qui prend en compte l’oral non formel donne des résultats parfois assez différents de ce que l’on observe à l’écrit, et qu’il y a par conséquent lieu de faire une place de choix au français parlé non planifié dans les ouvrages de référence. Si l’étude fait ressortir la spécificité de l’oral non formel, elle ne remet toutefois pas en question l’unité du système. Les phénomènes variationnels que nous avons observés ne nous conduisent pas à formuler une hypothèse de type diglossique ou dialectale, mais plutôt à adopter une conception polylectale de la grammaireOur reflection focuses on two aspects: (i) methodological issues related to the creation of data (selection of sub-corpora, size of sub-corpora, hierarchy among them, etc.), and (ii) the integration of the multiple usages of the grammar in reference books.Based on diversified French corpora, we présent the results of two linguistic analyzes - one on the lexical unit justement (‘precisely’) and the other on syntactic constructions like il y en a (beaucoup) qui dansent (‘there are many people who dance’). These two subjects of research illustrate important disparities according to the corpus for one and a significant difference between spoken and written French for the other. In parallel, we compare our corpus studies with the corresponding items in both grammars and dictionaries.Our study emphasizes that a linguistic description taking into account non- formal oral gives results that are sometimes quite different from what is observed in writing. So it is necessary to integrate unplanned spoken French in the reference grammars. Even if our work highlights the specificity of non-formal oral, we think that the system is unique. The facts we have observed do not lead us to formulate a “diglossic” or “dialectal” hypothesis, but rather to adopt a “polylectal” conception of grammar

OpenEdition

Les incidences de quelques aspects de la transcription outillée

Author: Cappeau Paul
Gadet Françoise
Guerin Emmanuelle
Paternostro Roberto
Publication venue: 'OpenEdition'
Publication date: 18/11/2014
Field of study

Cet article s’inscrit dans une réflexion d’abord méthodologique (mais en montrant les conséquences théoriques des choix faits lors de ces étapes) sur les premiers moments du travail sur les corpus, à partir de la sélection d’un logiciel d’aide à la transcription. L’interrogation majeure porte sur les enjeux et les incidences de ce que chacun des logiciels, par ses fonctionnalités propres, donne à voir du fonctionnement de l’oral, ou au contraire ne permet pas de montrer. Avec quelques exemples provenant d’un corpus recueilli en région parisienne, autour du discours rapporté, des particules et des chevauchements, l’objectif ultime est de montrer que cette phase initiale engage déjà toute une conception de la langue.This article discusses some methodological considerations which need to be adressed during the initial stages of corpus analysis, when selecting a transcription tool, in order to highlight the theoretical consequences of the choices made at this stage. The main focus of the article concerns the issues and implications of what different software tools, by virtue of their specific characteristics, reveal or on the contrary do not reveal about the way speech operates. The analysis is illustrated by examples drawn from a corpus of Parisian French around features such as reported speech, particles, and overlaps. It is argued that this initial stage already involves an entire conception of the language

OpenEdition

Spoken Corpora Good Practice Guide 2006

Author: Baude Olivier
Blanche-Benveniste Claire
Calas Marie-France
Cappeau Paul
Cordereix Pascal
de Lamberterie Isabelle
Goury Laurence
Jacobson Michel
Marchello-Nizia Christiane
Mondada Lorenza
Publication venue: HAL CCSD
Publication date: 01/01/2010
Field of study

International audienceThere is currently a vast amount of fundamental or applied research, which is based on the exploitation of oral corpora (organized recorded collections of oral and multimodal language productions). Created as a result of linguists becoming aware of the importance to ensure the durability of sources and a diversified access to the oral documents they produce, this Guide to good practice mainly deals with “oral corpora”, created for and used by linguists. But the questions raised by the creation and documentary exploitation of these corpora can be found in numerous disciplines: ethnology, anthropology, sociology, psychology, demography, oral history notably use oral surveys, testimonies, interviews, life stories. Based on a linguistic approach, this Guide also touches on the preoccupations of other researchers who use oral corpora (for example in the field of speech synthesis and recognition), even if their specific needs aren’t consistently dealt with in the present document

Corpus oraux, Guide des bonnes pratiques 2006. Version allemande

Author: Baude Olivier
Blanche-Benveniste Claire
Calas Marie-France
Cappeau Paul
Cordereix Pascal
De Lamberterie Isabelle
Goury Laurence
Jacobson Michel
Marchello-Nizia Christiane
Mondada Lorenza
Publication venue: HAL CCSD
Publication date: 01/01/2010
Field of study

International audienceViele Grundlagen- oder angewandte Forschungen beruhen zur Zeit auf der Auswertung von „Korpora der gesprochenen Sprache“ (geordneten Sammelwerken der Aufnahmen von mündlichen und multimodalen sprachlichen Produktionen). Dieses Handbuch der guten Praktiken entsteht aus der Erkenntnis von Sprachwissenschaftlern, die darauf bedacht sind, den Fortbestand der Quellen und einen verschiedenartigen Zugang zu den mündlichen von ihnen produzierten Produktionen zu sichern ; es schneidet zuerst die „Korpora der gesprochenen Sprache“ an, die von Sprachwissenschaftlern und für sie geschaffen und verwendet wurden. Die durch die Erschaffung und die dokumentarische Auswertung dieser Korpora hervorgerufenen Fragen trifft man aber in vielen Fächern : die Völkerkunde, die Anthropologie, die Soziologie, die Psychologie, die Demographie, die mündlich überlieferte Geschichte gebrauchen vor allem die verbale Befragung, die Aussage, das Interview, die Lebensgeschichte. Dieses Handbuch beruft sich auf das Verfahren der Sprachwissenschaftler, es stimmt aber mit den Beschäftigungen anderer Forscher überein, die Korpora der gesprochenen Sprache (z. B. in Sprachsynthese und -entzifferung) gebrauchen, auch wenn ihre spezifischen Bedürfnisse im vorliegenden Dokument nicht systematisch angeschnitten werden